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(54) Beta-carotene hydroxylase gene 

(57) A polypeptide which has p-carotene hydroxy- 
lase activity comprises an amino acid sequence 
selected from: 



(i) the amino acid sequence of SEQ ID NO:2; 

(ii) an amino acid sequence which is at least 70% 
homologous to the amino acid sequence of SEQ ID 
NO:2:and 

(iii) a fragment of sequence (i) or (ii) which is not a 
said sequence (ii). 
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Description 

[0001] The present invention relates to a p-carotene hydroxylase, a DNA coding for the p-carotene hydroxylase, a 
recombinant vector comprising the DNA. a transformant transformed with the vector, a method for preparing the p-car- 

5 otene hydroxylase and a method for preparing p-cryptoxanthin. 

[0002] In carotenoids synthesized by animals, plants and microorganisms, there are a group of compounds with a 
hydroxyl group(s) generically called xanthophylL These compounds are generated from carotenoids (starting sub- 
stances) by the catalytic action of hydroxylase. For example, one hydroxyl group is introduced into p-carotene to yield 
p-cryptoxanthin, Into which another hydroxyl group is introduced to yield zeaxanthin via the biosynthetic pathway shown 

10 below (see arrow (1 ) in Fig, 1 ): 

p-carotene -> p-cryptoxanthin -> zeaxanthin 

[0003] This p-cryptoxanthin is drained by introducing a hydroxyl group into one of the two ionone rings present in p- 
15 carotene. When another hydroxyl group is introduced into a position symmetric to the former position, zeaxanthin is pro- 
duced (Fig. 1). 

[0004] In a large number of plants and microorganisms, metabolism proceeds from p-carotene to zeaxanthin, produc- 
ing little p-cryptoxanthin. the intermediate into which only one hydroxyl group is introduced. 

[0005] This reaction is controlled by a hydroxylase gene called Crt Z. In this enzyme reaction, it is considered that two 
20 hydroxyl groups are introduced almost simultaneously. For example, under the control of a hydroxylase gene cloned 
from a bacterium belonging to the genus Erwinia . zeaxanthin is produced which is obtainable by introducing two 
hydroxyl groups into p-carotene. 

[0006] In Citrus yj^tJiu (Satsuma mandarine) which is a major citrus fruit in Japan, p-cryptoxanthin obtainable by 
introducing one hydroxyl group into p-carotene is considered to be one of the most important carotenoids. In particular. 
25 p-cryptoxanthin occupies 60-70% of the total carotenoid content in the edible part of this fruit. 

[0007] Considering this high p-cryptoxanthin content of Citrus unshiu . it is hard to think that the p-cryptoxanthin in 
Citrus unshiu is produced by a gene involved in the above-mentioned metabolic pathway. Also, it is still unknown 
whether p-cryptoxanthin is produced by those genes which have been already cloned. 

[0008] It is an object of the present irrvention to provide a p-carotene hydroxylase and a gene coding for the enzyme. 
30 [0009] As a result of intensive and extensive researches toward the solution of the above problem, the present inven- 
tors have succeeded in isolating from a citrus -derived cDNA library a DNA coding for a p-carotene hydroxylase. Thus, 
the present invention has been achieved. 

[001 0] Accordingly, the present invention provides a polypeptide which has p-carotene hydroxylase activity and which 
comprises an amino add sequence selected from: 

35 

(i) the amino acid sequence of SEQ ID NO: 2; 

(ii) an amino acid sequence which is at least 70% homologous to the amino acid sequence of SEQ ID NO: 2; and 

(iii) a fragment of sequence (i) or (ii) which is not a said sequence (ii). 

40 [001 1] The polypeptide is suitably a protein, typically a recombinant protein, selected from: 

(a) a protein consisting of the amino acid sequence of SEQ ID NO: 2; and 

(b) a protein which consists of the amino acid sequence of SEQ ID NO: 2 having deletion, substitution or addition 
of one or several amino acids. The present invention further relates to: 

45 

a polynucleotide, such as a DNA, coding for a polypeptide of the invention: 

a DNA coding for a p-carotene hydroxylase, comprising the nucleotide sequence of SEQ ID NO: 1 and prefer- 
ably the coding portion of that sequence; 

a recombinant vector comprising a polynucleotide of the invention, such as an expression vector in which the 
so polynucleotide is operably linked to a promoter; 

a transformant which is transformed with the above vector, suitably a procaryote or a eucaryote that is not a 
human-being; 

a method for preparing a p-carotene hydroxylase and/or p-cryploxanthin. which method comprises maintaining 
a transformant as defined above under conditions such that the desired p-carotene hydroxylase and/or p-cryp- 
S5 toxanthin is expressed and recovering the p-carotene hydroxylase and/or p-cryptoxanthin; 

a method for preparing a p-carotone hydroxylase comprising culturing the above transformant in a medium and 
recovering the p-carotene hydroxylase from the resultant culture; and 

a method for preparing p-cryptoxanthin comprising culturing the above transformant in a medium and recover- 
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ing p-cryptoxanthin from the resultarrt culture. 

[001 2] In the accompanying drawings: 

s Fig. 1 is a diagram showing a biosynthetic pathway of carotenoids. 

Fig. 2 presents chromatograms showing the results of high performance liquid chromatography. 

Fig. 3 is a diagram showing comparison of homology in amino acid sequences between the p-carotene hydroxylase 

of the invention and other enzymes. 

10 [0013] A polypeptide of the invention consists essentially of the amino acid sequence set out in SEQ ID NO: 2 or a 
substantially homologous sequence, or of a fragment of either of these sequences. The polypeptide may consist of 
such a sequence. In general, the naturally occurring sequence shown in SEQ ID NO: 2 is preferred. However, the 
polypeptides of the invention include homologues of the natural sequence, and fragments of the natural sequence and 
of Its homologues, which are capaljle of functioning as a p-carotene hydroxylase. The p-carolene hydroxylase catalyzes 

IS the reaction indicated by arrow (2) in Fig. 1. It typically does not catalyze the reaction indicated by arrow (1). 

[0014] An amino acid sequence at least 70% homologous to the amino acid sequence of SEQ ID NO: 2 will be pref- 
erably at least 80 or 90% and more preferably at least 95. 97 or 99% homologous thereto. Homology is calculated on 
the basis of amino acid identity. SEQ ID NO: 2 can thus be modified, typically by deletion, substitution and/or addition. 
One or several, for example up to five, amino acids of SEQ ID NO: 2 may be modified in this way. In fact more may be 

20 modified in this way within the scope of the invention. 

[0015] Amino acid substitutions may be made, for example from 1 , 2 or 3 to 10, 20 or 30 substitutions. The modified 
polypeptide retains p-carotene hydroxylase activity. Conservative substitutions may be made, for example according to 
the following Table. Amino acids in the same block in the second column and preferably in the same line in the third col- 
umn may be substituted for each other: 

25 



35 
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40 [0016] One or more amino acid residues of SEQ ID NO: 2 may alternatively or additionally be deleted. From 1 . 2 or 
3 to 10. 20 or 30 residues may be deleted, or more. Polypeptides of the invention also include fragments of the above- 
mentioned sequences (i) and (il). Such fragments retain p-carotene hydroxylase activity Fragments may be at least 
from 10. 12. 15 or 20 to 60, 100 or 200 amino acids in length. 

[0017] One or more amino acids may be alternatively or additionally added to SEQ ID NO: 2. An extension may be 
45 provided at the N-terminus or C-terminus of the sequence of SEQ ID NO: 2. The or each extension may be quite short, 
for example from 1 to 10 amino acids in length. Alternatively, the extension may be longer. A carrier protein may be 
fused to an amino acid sequence accordihg to the invention. A fusbn protein Incorporating the sequence (i). (ij) or (iii) 
above can thus be provided. 

[0018] A polypeptide of the invention may thus have been modified for example by the addition of Histidine residues 
so or a T7 tag to assist in identification or purification of the polypeptide. A signal sequence may have been added to pro- 
mote secretion of the polypeptide from a cell. 

[0019] Polynucleotides of the invention may be single-stranded or double-stranded. They may be a DNA such as a 
cDNA. or an RNA. They thus consist essentially of DNA or RNA encoding a polypeptide of the invention such as the 
protein having the amino acid sequence of SEQ ID NO: 2. A DNA of the invention can consist essentially of the coding 
55 sequence of SEQ ID NO: 1 or of a modified form of that sequence, in particular a modified form that is substantially 
homologous to the coding sequence of SEQ ID NO: 1. 

[0020] A DNA sequence of the invention may therefore be at least 80 or 90%, and more preferably at least 95%. 
homologous to the coding sequence of SEQ ID NO: 1 . There may be as few as from 1 to 30, for example from 5 to 20, 
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nucleotide differences. A DNA sequence may code for the amino acid sequence of SEQ ID NO: 2 but one or more 
codons may be different from corresponding codons of the coding sequence of SEQ ID NO: 1 due the degeneracy of 
the genetic code. 

[0021 1 A polynucleotide of the invention Is typically capable of hybridising selectively with the sequence shown in SEQ 
5 ID NO: 1 which is complementary to the coding sequence of SEQ ID NO: 1. A polynucleotide of the invention and the 
complementary sequence of SEQ ID NO: 1 can hybridize at a level significantly above background. Background hybrid- 
ization may occur, for example, because of other cDNAs present in a cDNA library. The signal level generated by the 
Interaction between a polynucleotide of the invention and the sequence shown in SEQ ID NO: 1 which Is complemen- 
tary to the coding sequence is typically at least 10 fold, preferably at least 100 fold, as intense as interactions t>etween 
10 other polynucleotides and that complementary sequence. The Intensity of interaction may be measured, for example, 
by radiolabelling the probe, e.g. with ^^R Selective hybridisation is typically achieved using conditions of medium to high 
stringency (for example 0.03M sodium chloride and 0.03M sodium citrate at from about 50"C to about 60"C). 
[0022] Polynucleotides of the Invention may be polynucleotides which include within them synthetic or modified nucle- 
otides. A number of different types of modification to polynucleotides are known In the art. These include methylphos- 
15 phonate and phosphorothioate backbones, addition of acridine or polylysine chains at the 3' and/or 5' ends of the 
molecule. For the purposes of the present invention, it is to be understood that the polynucleotides described herein 
may be modified by any method available in the art. Such modifications may be carried out in order to enhance the in 
vivo activity or lifespan of polynucleotides of the invention. 

[0023] Polynucleotides according to the invention may be produced recomblnantly. synthetically, or by any means 
20 available to those of skill in the art. They may also be cloned by standard techniques. The polynucleotides are typically 
provided in isolated and/or purified form. 

[0024] A cDNA of the invention can be isolated by the following procedures. Briefly, a primer Is designed based on a 
conserved region of a gene coding for a bacterium-derived p-carotene hyroxylase. Then. 3' RACE RT-PCR is performed 
using the above primer and, as a template, a first strand cDNA from the fruit (juice sacs) and flower of Citrus unshiu 
25 (variety: Miyagawa early) to obtain a cDNA fragment of the Citrus unshiu p-carotene hydroxylase. Subsequently, using 
this cDNA fragment as a probe, the p-carotene hydroxylase of interest can be isolated from a cDNA library derived from 
the edible part of Citrus unshiu . 

1 . Cloning of a DNA Coding for the p-Carotene Hydroxylase (1) Preparation of primers 

30 

[0025] First, primers for use In the 3' RACE RT-PCR to be described later are prepared. In order to design a primer 
that Is more specific to a DNA of interest. It is appropriate to prepare an oligonucleotide coding for a region in which 
amino acid residues are highly conserved among various bacteria and plants. Such a primer can be prepared by con- 
ventional chemical synthesis. For example, the following amino acid sequences may be selected as regions satisfying 
35 the above-mentioned condition: 

I) rPhe Glu Leu Asn Asp Val Phe AlaJ (SEQ ID NO: 3) 
ii) [His Asp Gly Leu Val HisJ (SEQ ID NO: 4) 

40 [0026] Since these two regions with highly conserved amino acid residues are located close to each other, they can- 
not be used as a sense primer and an antisense primer in a PCR. Thus, In the present invention. 3* RACE RT-PCR 
method was employed in which each of these sequences was used as a sense primer. 

[0027] The above sequences are found within the amino acid sequences for an Arabidopsis -derived and an Erwmia- 
derived p-carotene hydroxylase described by Zairen Sun et al., The Journal of Biological Chemistry, 1 996; Vol. 271 , No. 
45 40: 24349-24352 and Nakagawa M. and N. Misawa, Agric. Biol. Chem. 55:2147-2148. respectively. 

[0028] Based on these amino acid sequences, oligonucleotide primers with the following sequences, for example, are 
prepared. However, the primers are not limited to these sequences. 

Sense 1 primer (Bech-a): TT(t/c)GA(g/a)CTAAA(cA)GA(t/c)GTN (SEQ ID NO: 5) 
so Sense 2 primer (Bech-B): CACGA(crt)GGTCTNGTNCA (SEQ ID NO: 6) 

(2) 3' RACE RT-PCR 

[0029] Subsequently, a 3'RACE RT-PCR is performed using the two sense primers synthesized. RT-PCR (reverse 
55 transcription-PCR) is a method in which a DNA is synthesized (reverse transcribed) with RNA as a tenplate using a 
reverse transcriptase, and thereafter a PCR is performed using the synthesized DNA as a template. 3' RACE (rapid 
amplification of cDNA ends) is a method in which an RT-PCR is performed based on a nucleotide sequence of a known 
region to thereby clone the unknown region of a cDNA of interest up to the relevant cDNA end. 
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[0030] First, a reverse transcription is performed using an oligo(dT) primer having an adaptor sequence at its 5' end 
to thereby synthesize a first strand cDNA. All of the resultant first strand cDNA molecules have a structure in which the 
adaptor sequence is attached to the end. Therefore, in the cDNA to be cloned, the unknown region is located between 
the known sequence and the adaptor sequence. Then, the unknown region (cDNA partial sequence) sandwiched 
5 between the two sequences can be amplified by performing a PGR using a part of the known sequence as a sense 
primer together with the adaptor primer, 

[0031 ] An RT-PCR can be performed using a commercial kit (T-Primed First-Strand Kit: Pharmacia). 

(3) Preparation of a cDNA Library 

10 

[0032] In order to obtain the full-length cDNA of interest from a fruit-derived cDNA library using the cDNA partial 
sequence obtained above as a probe, the library Is prepared as described t^low. 

[0033] Total RNA is isolated from individual citrus organs or tissues (fruit, leaf. root, flower, callus, etc.) using a gua- 
nidine reagent or SDS-phenol. Then. mRNA is prepared from the total RNA by the affinity column method using oligo 

15 dT-cellulose or poly U-Sepharose carried on Sepharose 2B or by a method using an oligotex resin. Using the resultant 
mRNA as a template, a single-stranded cDNA is synthesized with a reverse transcriptase. Thereafter, a double- 
stranded cDNA is synthesized from the single-stranded cDNA, The resultant double-stranded cDNA is ligated to an 
appropriate plasmid or phage vector using a ligase to thereby obtain a recombinant DNA. By infecting or transforming 
Escherichia coli or the like with this recombinant DNA, a cDNA library capable of screening by plaque or colony hybrid- 

20 ization can be obtained. 

(4) Isolation of a p-Carotene Hydroxylase cDNA Homologue from the cDNA Library 

[0034] Subsequently, screening for the full-length cDNA sequence is performed by plaque or colony hybridization 
25 using the cDNA sequence isolated by the 3' RACE RT-PCR described above as a probe. For this hybridization, a com- * 
mercial kit such as ECL Nucleic Acid Labelling and Detection System (Amersham) may be used. 

(5) Determination of the Nucleotide Sequence 

— '30- -[0035]~The"nucleotide"sequence of th^btained clone is determinedTThis can be performed by conventional methods 
such as Maxam-Gilbert method, the dideoxy method or the like. Usually, the determination is carried out with an auto- 
matic DNA sequencer. 

r [0036] SEQ ID NO: 1 shows the nucleotide sequence for the DNA of the invention and SEQ ID NO: 2 shows the amino 

acid sequence for the p-carotene hydroxylase of the invention. However, as long as a protein consisting of this amino 

35 acid sequence has p-carotene hydroxylase activity, the sequence may have some mutation such as deletion, substitu- 
tion or addition of one or several amino acids. For example, a protein consisting of the amino acid sequence of SEQ ID 
NO: 2 in which Met at the first position has been deleted is also included in the protein of the invention. 
[0037] The p-carotene hydroxylase activity In the present invention means an activity to perform a catalytic reaction 
producing p-cryptoxanthin from p-carotene. 

40 [0038] Once the nucleotide sequence for the DNA of the invention has been established, the DNA of the Invention 
can be obtained by chemical synthesis or by hybridization using a DNA fragment having a part of the sequence as a 
probe. 

2. Preparation of a Recombinant Vector and a Transformant 
(1) Preparation of a Recombinant Vector 

[0039] The reconnblnant vector of the invention can be obtained by ligating (inserting) the DNA of the invention to (into) 
an appropriate vector. The vector into which the DNA of the invention is to be inserted is not particularly limited as long 

so as it is replicable in a host. For example, a plasmid DNA, a phage DNA or the like may be used. 

[0040] A plasmid DNA can be prepared from E, coN or Aorobacterium by alkali extraction (Birnboim. H.C. & Doly. J. 
(1979), Nucleic Ackj Res.. 7:1513) or variations thereof. Alternatively, a commercial plasmid such as pBluescript II SK+ 
(Stratagene), pUCl18 (TaKaRa). pUCl19 (TaKaRa). pGEM-T (Promega) or the like may be used It is preferred that 
these plasmids contain a selectable marke- such as ampicillin resistance gene, kanamycin resistance gene or chloram- 

55 phenicol resistance gene. 

[0041] As a phage DNA. Ml3mp18, M13mp19 or the like may be given. 

[0042] For insertion of the DNA of the invention into a vector, a method may be employed in which the purified DNA 
is digested with an appropriate restrknion enzyme and then inserted into tiie relevant restriction site or the multi-cloning 
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site of the vector for ligation. The DNA of the invention should be incorporated in the vector in such a manner that the 
function thereof is operable. For this purpose, the vector of the invention may contain a terminator, ribosome binding 
sequence or the like in addition to a promoter and the DNA of the invention. 

s (2) Preparation of a Transformant 

[0043] The transformant of the invention can be obtained by introducing the recombinant vector of the invention into 
a host so that the gene of interest can be expressed. 

[0044] The host is not particularly limited as long as it can express the DNA of the invention. Specific examples of the 
10 host include Escherichia or Bacillus bacteria such as E coli and Bacillus subtilis : yeasts such as Saccharomvces cer- 
evisiae : or animal cells such as COS cells and CHO cells. 

[0045] When a bacterium such as E, con is used as the host, preferably, the recombinant vector of the Invention is 
capable of autonomous replication in the host and, at the same time, is constituted by a promoter, a ribosome binding 
sequence, the DNA of the invention and a transcription termination sequence. The vector may also contain a gene to 
15 control the promoter. 

[0046] As the expression vector. pBluescript II vector (Stratagene). pET vector (Stratagene) or the like may be used. 
[0047] As the promoter, any promoter may be use as long as it can direct the expression of the DNA of the invention 
in the host such as E coli . 

[0048] For example, an E coN-derived or phage-derived promoter such as trp promoter, lac promoter, Pl promoter or 

20 pR promoter may be used. 

[0049] As a method for introducing the recombinant vector into the bacterium, any method of DNA introduction into 
bacteria may be used. For example, a method using calcium ions (Proc. Natl. Acad. Sci.. USA, 69:2110-2114 (1972)) 
may be used. When a yeast is used as the host. YEp13, YEp24, YCp50 or the like is used as an expression vector. As 
a promoter used in this case, any promoter may be used as long as it can direct the expression of the DNA of the inven- 

25 tion in yeasts. For example, gall promoter, gal 10 promoter, heat shock protein promoter. MF a1 promoter or the like 
may be enumerated. 

[0050] As a method for introducing the recombinant vector into the yeast, any method of DNA introduction into yeasts 
may be used. For example, electroporation (Methods EnzymoL. 194:182-187 (1990)). the spheroplast method (Proc. 
Natl. Acad. So., USA, 84:1929-1933 (1978)). the lithium acetate method (J. BacterioL, 153:163-168 (1983)) or the like 
30 may be enumerated. 

[0051] When an animal cell is used as the host. pcDNAI/Amp (Invitrogen) or the like is used as an expression vector. 
In this case, the early gene promoter of human cytomegalovirus or the like may be used as a promoter. 
[0052] As a method for introducing the recombinant vector into the animal cell, electroporation. the calcium phosphate 
method, lipofection or the like may be enumerated. 
35 [0053] The recombinant vector of the invention incorporated in E £o!i (designation: EpCitBECHI) was deposited on 
1 December 1997 at the National Institute of Bioscience and Human-Technology Agency of Industrial Science and 
Technology (1-3, Higashi 1-Chome. Tsukuba City, Ibaraki Pref., Japan) as PERM BP-6188 under the Budapest Treaty 
The deposit was made in the name of Toru Maotani; Director General of National Institute of Fruit Tree Science, Ministry 
of Agriculture. Forestry and Rsheries; 2-1, Fujimoto; Tsukuba-shi; Ibaraki 305; Japan. 

40 

3. Production of the p-Carotene Hydroxylase 

[0054] The p-carotene hydroxylase of the invention can be obtained by culturing the transformant described above 
and recovering the p-carotene hydroxylase from the resultant culture. 
45 [0055] The cultivation of the transformant of the invention in a medium is carried out by conventional methods used 
for culturing a host. 

[0056] As a medium to culture the transformant obtained from a microorganism host such as E. CQli or yeast, either 
a natural or a synthetic medium may be used as long as it contains carbon sources, nitrogen sources and inorganic salt 
sources assimilable by the microorganism and can be used for effective cultivation of the transformant 
50 [0057] As carbon sources, carbohydrates such as glucose, fructose, sucrose, starch; organic adds such as acetic 
acid, propionic acid; and alcohols such as ethanot and propanol may be used. 

[0058] As nitrogen sources, ammonia; ammonium salts of inorganic or organic acids such as ammonium chloride, 
ammonium sulfate, ammonium acetate, ammonium phosphate; other nitrogen-containing compounds; Peptone; meat 
extract; corn steep liquor and the like may be used. 
55 [0059] As inorganic substances, potassium dihydrogen phosphate, dipotassium hydrogen phosphate, magnesium 
phosphate, magnesium sulfate, sodium chloride, iron(ll) sulfate, manganese sulfate, copper sulfate, calcium carbonate 
and the like may be used. 

[0060] Usually, the cultivation is carried out under aerobic conditions (such as shaking culture or aeration agitation 
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culture) at 28 *C for 48 to 60 hrs. During the cultivation, the pH is maintained at 7.0 to 7.5. TTie pH adjustment is carried 
out using an inorganic or organic salt, an alkali solution or the like. When an E. coli transformant is cultured, it is prefer- 
able to allow pACCARl6AcrtX plasmid (having 4 ErwiQia-derlved genes that can produce carotenoids from farnesyl 
diphosphate to p-carotene) to coexist In the E coli . 

£ [0061 ] During the cultivation, an antibiotic such as ampicillin or tetracycline may be added to the medium if necessary 
[0062] When a microorganism transformed with an expression vector using an inducible promoter is cultured, an 
inducer may be added to the medium if necessary For example, when a microorganism transformed with an expression 
vector using Lac promoter is cultured. isopropyl-p-D-thiogalactopyranoside (IPTG) or the like may be added. When a 
microorganism transformed with an expression vector using trp promoter is cultured, indoleacetic acid (lAA) or the like 

JO may be added. 

[0063] As a medium to culture a transformant obtained from an animal cell as a host, commonly used RPMM640 
medium or DMEM medium, or one of these media supplemented with fetal bovine serum, etc. may be used. 
[0064] Usually the cultivation is carried out in the presence 5% CO2 at 37 *»C for 1 to 2 days. 

[0065] During the cultivation, an antibiotic such as kanamycin or penicillin may be added to the medium if necessary. 

15 [0066] After the cultivation, the p-carotene hydroxylase of the invention is recovered by disrupting the microorganisms 
or cells if the enzyme is produced in the microorganisms or cells. If the p-carotene hydroxylase of the invention is pro- 
duced outside of the microorganisms or cells, the culture fluid (as it is or after centrifugation to renrtove the microorgan- 
isms or cells) is subjected to conventional biochemical techniques used for isolating/purlfyinng a protein. These 
techniques include ammonium sulfate precipitation, gel chromatography, ion exchange chromatography and affinity 

20 chromatography These techniques may be used independently or in an appropriate combination to isolate and purify 
the p-carotene hydroxylase of the invention from the culture. 

[0067] The confirmation that the finally obtained protein is a p-carotene hydroxylase can be made by SDS-polyacry- 
lamide gel electrophoresis. 

25 4. Production of p-Cryptoxanthin 

. [0068] In the present invention, it is also possible to produce p-cryptoxanthin in the same manner as desaibed in the 

purification of the p-carotene hydroxylase. Briefly, the transformant described above is cultured in a medium and then 
p-cryptoxanthin is extracted from the resultant culture. The method of cultivation is the same as described in "3. Pro- 

30 — duction of the p-Garotene Hydroxylase": ^ ~~ 

[0069] After the cultivation, the microorganisms or cells are removed from the culture by centrifugation of the like. 
Then, p-cryptoxanthin can be extracted from the culture by HPLC or the like. 

i [0070] The confirmation that the finally extracted substance is p-cryptoxanthin can be made by ^ H-NMR, ultraviolet- 

visible spectroscopy mass spectrometry, etc. 
35 [0071 ] Now, the present invention will be described more specifically below with reference to the following Exanples. 
which should not be instrued as limiting the technical scope of the invention. 

EXAMPLE 1 

40 Cloning of a cDNA Coding for the p-Carotene Hydroxylase 

(1) Qoning of a Partial cDNA of Interest Using 3' RACE RT-PCR 

[0072] A 1st-strand cDNA was prepared by performing a reverse transcription using Notl-D(T),8 

45 (5'd[AACTGGAAGAATTCGCGGGCGCAGGAATi8]3')(SEQ ID NO: 7) as a primer and RNA from the fruit (juice sacs) 
and flower of Cilrys gnshiyj (variety: Miyagawa early) as a template. At the time of this synthesis. NotI adaptor sequence 
(TGGAAGAATTCGCGGCCGCAG) (SEQ ID NO: 8) was added at the 3* end of every Ist-strand cDNA fragment. Using 
this 1st-strand cDNA as a template, a PCR was performed with Sense 1 primer and the adaptor primer. The reaction 
was carried out 35 cycles. 1 cycle consisting of denaturation at 94.5°C for 40 sec and annealing/extension at 60 **C for 
so 2 min. In this first stage PCR. however, the adaptor primer used is the sequence contained commonly in all of the cDNA 
fragments generated by the reverse transcription reaction. Therefore, the PCR product obtained at this stage contains 
a large number of non-specif ically amplified DNA fragments. In order to amplify the DNA of interest specifically, the sec- 
ond stage PCR was performed using Sense 2 primer. This reaction was carried out 35 cycles, one cycle consisting of 
denaturation at 94.5^C for 35 sec, annealing at 55°C for 45 sec and extension at 72 **C for 1 min. For the RT-PCR. a 
55 commercial kit (TPrimed First-Strand Kit: Pharmacia) was used. 

[0073] By the above procedures, a cDNA partial sequence coding for a citrus p-carotene hydroxylase was obtained. 
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(2) Preparation of a cDNA Library from a Citrus Fruit Tissue 

[0074] Total RNA was isolated from the fruit (juice sac tissue) of Citrus unshiu (variety: Miyagawa early) using guani- 
dine thiocyanate. After the isolated total RNA was purified into mRNA using Oligotex-dT30 [Super] (TaKaRa). a first- 
5 strand cDNA was synthesized using an oligo(dT) 12-13 primer and a reverse transcriptase from Moloney murine leuke- 
mia virus (MMLV). Further, a second-strand cDNA was synthesized using a DNA polymerase (Pharmacia). To the 
resultant double-stranded cDNA. EcoRl adaptor was added by T4 DNA ligase. followed by ligation to Uni-2AP EcoRI 
phagemid vector (Stratagene). 

10 (3) Screening of the Full-Length cDNA of Interest by Plaque Hybridization 

[0075] Subsequently, the full-length cDNA sequence of interest was screened by plaque hybridization using the cDNA 
partial sequence coding for a citrus p-carotene hydroxylase obtained by the 3' RACE RT-PCR described above. 
[0076] A commercial kit (ECL nucleic add labelling and detection system: Amersham) was used for the hybridization. 

15 As a result of the screening (3x10"* pfc). a p-carotene hydroxylase cDNA homologue of 11 58 bp in full length was iso- 
lated which was coding for a peptide of 31 1 amino acid residues with an estimated molecular weight of 34.7 kDa. This 
clone exhibited 76.3% homology to an Arabidopsis -derived p-carotene hydroxylase cDNA which produces zeaxanthin 
from p-carotene, and 35.7 to 39.8% homology to bacteria-derived p-carotene hydroxylase genes which produce zeax- 
anthin. This clone was designated "CitBECHr. The nucleotide sequence for CitBECHI is shown in SEQ ID NO: 1 , and 

20 the amino acid sequence encoded by CitBECHI is shown in SEQ ID NO: 2. 

[OOTTH The results of comparison of homology between conventional p-carotene hydroxylases and the p-carotene 
hydroxylase of the invention are shown in Fig. 3. 

[0078] In Fig. 3. shown at the top row (CitBECHI) is the p-carotene hydroxylase amino acid sequence encoded by 
the gene of the invention. The others are amino acid sequences encoded by related genes; any of these sequences is 
25 a sequence for a gene producing zeaxanthin from p-carotene skipping over p-cryptoxanthin. 

EXAMPLE 2 

Production of p-Cryptoxanthin in Escherichia coli Having the p-Carotene Hydroxylase 

30 

(1) Expression of the DNA of the Present Invention 

[0079] The isolated clone was inserted into pBluescript II SK-i- plasmid having an ampicillin resistance gene. The 
resultant plasmid was introduced Into Escherichia cqU. in which pACCAR16AcrtX plasmid (having 4 Erwinia-derived 
35 genes that can produce from farnesyl diphosphate to p-carotene) was allowed to coexist. The resultant cqU was cul- 
tured in LB medium at 28*'C for 60 hrs. 

[0080] Then, the culture was subjected to acetone extraction. The acetone extract from the transformant was sub- 
jected to HPLC using a system manufactured by Japan Spectroscopic Co.. Ltd. As a column, a C30 column manufac- 
tured by YMC was used. As eluent A, a mixture of methanol/methyl-t-butyl ether/water mixed at a ratio of 81/15/4 was 
40 used. As eluent B, a mixture of methanol/methyl-t-butyl ether mixed at a ratio of 10/90 was used. Gradient conditions 
were as follows: eluent A 1 00% at the time of start; eluent A 20% and eluent B 80% 70 min after the start. The flow rate 
was 1 .0 ml/min and the column temperature 22^*0. The detection wave length was 450 nm. 

[0081] As a result, the chromatograms shown In Fig. 2 were obtained. When the resultant peaks were conrpared with 
the peaks of the carotenokj standard products manufactured by Funakoshl. it was found that the B coli produced p- 
45 cryptoxanthin, p-carotene and zeaxanthin at a ratio of 43:22:1 1 . From this result, it was judged that the citrus-derived 
p-carotene hydroxylase mainly produces p-cryptoxanthin. 

(2) Production and Identification of p- Cryptoxanthin 

so [0082] Plasmid pCitBECH 1-introduced. g-carotene-producino E. coli JM101 [E. coli (PACCARISAcrtX. pCitBECH 1)] 
(presenting a yellow color) was cultured in 1 .6 L of 2xYT medium [1 .6% tryptone, 1% yeast extract. 0.5% NaCI] contain- 
ing 150 pg/ml of ampicillin (Ap) and 30 ng/ml of chloramphenicol (Cm) at 30**C for 28 hrs. Cells were harvested from 
the culture fluid were subjected to extraction with 360 ml of acetone. The resultant extract was concentrated and 
extracted with 200 ml of chlorofbrm/methanol (9/1) twice, followed by concentration and drying. The resultant solid 

55 material was dissolved in a small amount of chloroform/methanol (9/1 ) and then subjected to thin layer chromatography 
(TLC) in which the sample was developed with chloroform/methanol on a silica gel preparative TLC plate from Merck. 
[0083] As a result of this TLC, the initial pigments were divided into two spots of Rf values 0.4 (dark) and 0.1 (very 
light), respectively, in addition to the p-carotene spot at the top. Then, the dark yellow pigment of Rf 0.4 was scratched 
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off from the TLC plate, dissolved in a small amount of chloroform/methanol (1/1) and subjected to TOYOPEARL HW-40 
column chromatography for development and elution. 
[0084] . As a result. 1 mg of the pure pigment was obtained. 

[0085] This pigment was considered to be p-cryptoxanthin from the results of examination of the ultraviolet-visible 
5 spectrum (7. 425. 448. 475 nm in methanol) and the FD-MS spectrum (m/e 553. [M]+). Further, two signals of 3-hydroxy- 
p-ionone ring and p-ionone ring (G. Englert, N.M.R. of Carotenoids edited by G. Britton. T.W. Goodwin, Carotenoid 
Chemistry and Biochemistry) were confirmed from its ^H-NMR spectrum. 

[0086] Consequently, this pigment was identified as p-cryptoxanthIn (Fig. 2). Fig. 2 indicates that the gene of the 
invention Is involved in biosynthesis of p-cryptoxanthin. In Fig. 2, the top panel shows the results of HPLC analysis of 
10 the carotenoids produced by cQli in which an Ejwiia-derived p-carotene biosynthesis gene was incorporated; the 
middle panel shows the results of HPLC analysis of the carotenoids produced by the above E. coli in which the gene of 
the invention was further incorporated; and the bottom panel shows the results of HPLC analysis of zeaxanthin. p-cryp- 
toxanthin and p-carotene standard products. 

[0087] From Fig. 2. It can be seen that, different from conventional p-carotene hydroxylases encoded by known genes 
15 (Crt Z) derived from Erwinia and marine bacteria, the p-carotene hydroxylase encoded by the gene of the invention cat- 
alyzes synthesis of carotenoids in such a manner that p-cryptoxanthin is produced mainly and zeaxanthin is produced 
in only a small amount (Fig. 2. middle panel). 

[0088] According to the present invention, a p-carotene hydroxylase, a DNA coding for the p-carotene hydroxylase, a 
recombinant vector comprising the DNA, a transformant transformed with the vector, a method for preparing the p-car- 
20 otene hydroxylase and a method for preparing p-cryptoxanthin are provided. 

[0089] The p-carotene hydroxylase of the invention is useful in catalyzing synthesis of p-cryptoxanthin. a pigment nec- 
essary and important for maintaining the quality and function of citrus fruits and processed products thereof. 
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SEQUENCE LISTING 

SijQ ID NO: 1 
SKQUENCE LENGTH: 1158 
SKQUENCE TYPE: nucleic acid 
STHANDEDNESS : double 
TOPOLOGY: linear 
,5 MOLECULE TYPE; CDNA to mRNA 

ORIGINAL SOURCE 

ORGANISM: Ci trus unsh iu 
20 FEATURE 

NAME /KEY: CDS 
LOCATION: 87.. 1019 
SEQUENCE DESCRIPTION 

CCACAATCCA CTTCACATCA ACTCTTCCTC TTTTCAAGTG CTTTTACTCT AAAACCCAAA 60 
ACCTCGTAAA CAAACAAAAC CCCACC ATG GCG GTC GGA CTA TTG GCC GCC ATA 113 

Met Ala Val Gly Leu Leu Ala Ala lie 

1 5 

GTC CCG AAG CCC TTC TGT CTC CTC ACA ACA AAA CTT CAA CCC TCT TCG 161 
Val Pro Lys Pro Phe Cys Leu Leu Thr Thr Lys Leu Gin Pro Ser Ser 
10 15 20 25 

CTC CTC ACA ACA AAA CCC OCT CCC CTT TTT GCC CCT CTC GGT ACC CAC 209 
Leu Leu Thr Thr Lys Pro Ala Pro Leu Phe Ala Pro Leu Gly Thr His 
30 35 40 

« CAT GGC TTC TTT AAT GGC AAA AAC CGA AGA AAA CTC AAC TCT TTC ACC 257 

His Gly Phe Phe Asn Gly Lys Asn Arg Arg Lys Leu Asn Ser Phe Thr 
45 50 55 

^ GTA TGT TTT GTT TTA GAG GAG AAA AAA CAA AGC ACC CAG ATC GAG ACT 305 

Val Cys Phe Val Leu Glu Glu Lys Lys Gin Ser Thr Gin lie Glu Thr 

55 



25 



30 



35 



40 



10 
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60 65 70 
TTC ACG GAC GAG GAG GAG GAG GAG TCG GGT ACC CAG ATC TCG ACT GCT 353 

Phc Thr Asp Glu Glu G.lu Glu GJu Sc?r G.ly Thr GJn I Jo Ser Thr A}a 

75 80 «5 

GCC CGC GTG GCC GAG AAA TTG GCG AGA AAG AGA TCC GAG AGG TTC ACT 4 01 

Ala Arg Val Ala Glu Lys Leu Ala Arg Lys Arg Ser Glu Arg Phe Thr 

90 95 100 lOS 

TAT CTC GTT GCT GCC GTC ATG TCT AGT TTT GGT ATC ACT TCC ATG GCT 449 

Tyr Leu Val Ala Ala Val Met Ser Ser Phe Gly He Thr Ser Met Ala 

110 115 120 

so GTC ATG GCT GTT TAT TAC AGG TTC TGG TGG CAA ATG GAG GGT GGA GAG 497 

Val Met Ala Val Tyr Tyr Arg Phe Trp Trp Gin Met Glu Gly Gly Glu 

125 130 135 

25 GTG OCT TTA GCT GAA ATG TTT GGC ACA TTT GCT CTC TCT GTT GGT GCT 545 

Val Pro Leu Ala Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly Ala 

140 145 150 



15 



-30 



35 



40 



45 



GCT GTG GGC ATG GAG TTT TGG GCA CGA TGG GCT CAT AAA GCT CTG TGG 593 
Ala Val Gly Met Glu Phe Trp Ala Arg Trp A3 a His Lys Ala Leu Trp 

155 160 165 

CAT GCT TCT TTA TGG CAT ATG CAC GAG TCT CAC CAT CGA CCA AGA GAG 641 
His Ala Scr Leu Trp His Met His Glu Ser His His Arg Pro Arg Glu 
170 175 180 185 

GGT CCT TTT GAG CTA AAC GAT GTG TTT GCC ATA ATC AAC GCA GTT CCA 689 
Gly Pro Phe Glu Leu Asn Asp Val Phe Ala He He Asn Ala Val Pro 

190 195 200 

GCC ATA GCC CTT CTC TCT TTT GGC TTC TTC CAC AAA GGC CTT GTA CCT 737 
Ala He Ala Leu Leu Ser Phe Gly Phe Phe His Lys Gly Leu Val Pro 
so 205 210 215 

GGT CTC TGC TTT GGT GCT GGA CTT GGC ATT ACG GTG TTT GGG ATG GCC 785 



55 
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GJy Lou Cys Pho GJy Ala Gly l.eu Gly lie Thr Val Pho Gly Mel Alcj 

220 225 230 

TAG ATG TTC GTC CAC GAT GGT CTC GT'I" CAC AAA AGG TTC OCT G'l'G GGT 63^ 
Tyr Met Pho Val His Asp Gly Leu Val His Lys Arg Phe Pro Val Gly 

235 240 245 

CCC Arr GCC GAC GTG CCT tat TTC CGG AGA GTC GCT GCG GOT CAC CAG 881 
t»ro He Ala Asp Val Pro Tyr Phe Arg Arg Val Ala Ala Ala His Gin 
15 250 255 260 265 

CTT CAC CAC TCG GAT AAA TTC CAC GGT GTT CCA TAT GGG CTC TTT CTC 929 
Leu His His Ser Asp Lys Phe His Gly Val Pro Tyr Gly Leu Phe Leu 
^ 270 275 280 

GGA CCT A AG GAG CTT GAA GAA GTG GGG GGA CTA GAA GAA TTG GAG A AG 977 
Gly Pro Lys Glu Leu Glu Glu Val Gly Gly Leu Glu Glu Leu Glu Lys 

285 290 295 

GAG ATC AGT AAG AGA ATC AAA TCA TAC AAC AGG GTT CCA AAA 1019 
Glu lie Ser Lys Arg He Lys Ser Tyr Asn Arg Val Pro Lys 

300 305 310 

TAATCAATTT AATGGGAGGA CCAATTTTTG GATCAATTTG TCAGTGTACA GAAACAATAG 1079 
TGTTATTAAT GAAAAAAATA AATTATGAAT GCTTATGGGT GGATTACTGT TGTAAAGTTT 113 9 
ATGATGTTAA ATAATATAT 1158 



25 



30 



35 



40 SEQ ID NO: 2 

SEQOENCE LENGTH: 311 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION 

SO 

Met Ala Val Gly Leu Leu Ala Ala He Val Pro Lys Pro Phe Cys Leu 
15 10 15 

55 
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Leu Thr Thr Lys Leu Gin Pro Sc:: Scr Lou Leu Thi Thr Lys Fro Aia 

20 25 30 

Pro Leu Phe Ala Pro Leu Gly Thr His His Giy Phe Phe Asn Gly Lys 

35 40 45 

Asn Arg Arg I.ys Leu Asn Ser Phe Thr VaJ Cys Phe VaJ Leu Glu GJu 

50 55 60 

Lys Lys GJn Ser Thr Gin lie Glu Thr Phe Thr Asp Glu Glu Glu Glu 
15 65 70 75 80 

Glu ser Gly Thr Gin He Ser Thr Ala Ala Arq Val Ala Glu Lys Leu 

85 90 95 

Ala Arg Lys Arg Ser Glu Arg Phe Thr Tyr Leu Val Ala Ala Val Met 

100 105 110 

Ser Ser Phe Gly lie Thr Ser Met Ala Val Met Ala Val Tyr Tyr Arg 

115 120 125 

Phe Trp Trp Gin Met Glu Gly Gly Glu Val Pro Leu Ala Glu Met Phe 
130 135 140 



20 



25 



35 



40 



Gly Thr Phe Ala Leu Ser Val Gly Ala Ala Val Gly Met Glu Phe Trp 
145 150 155 160 

Ala Arg Trp Ala His Lys Ala Leu Trp His Ala Ser Leu Trp His Met 

165 170 175 

His Glu Ser His His Arg Pro Arg Glu Gly Pro Phe Glu Leu Asn Asp 

380 185 190 

Val Phe Ala lie lie Asn Ala Val Pro Ala He Ala Leu Leu Ser Phe 
195 200 205 

45 Gly Phe Phe His Lys Gly Leu Val Pro Gly Leu Cys Phe Gly Ala Gly 

210 215 220 

Leu Gly He Thr Val Phe Gly Met Ala Tyr Met Phe Val His Asp Gly 
^ 225 230 235 240 

r^u Val His Lys Arg Phe Pro Val Gly Pro He Ala Asp Val Pro Tyr 



55 
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15 



20 



25 



30 



35 



45 



SO 



55 



245 250 255 

Phe Arg Arg Va I Ala Ala Ala His Gin L<^u His His Ser Asp Lys Phc 

2G0 265 270 

His Gly Val Pro Tyr GJy Leu Phc Leu Gly Pro Lys Glu Leu Glu Glu 

275 280 285 

VaJ Gly Gly I.eu Glu GJu Leu Glu Lys Glu He Ser I.ys Arg lie Lys 

290 295 300 

Ser Tyr Asn Arg Val Pro Lys 
305 310 

SEQ ID NO: 3 
SEQUENCE LENGTH: 8 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION 
Phe Glu Leu Asn Asp Val Phe Ala 
1 5 



SEQ ID NO: 4 
SEQUENCE LENGTH: 6 
^ SEQUENCE TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION 
His Asp Gly Leu Val His 
1 5 



SEQ ID NO: 5 
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SEQUENCE LEN;GTH: 18 



SEQUENCE TYPE: nucleic: acid 



5 



STRANDEDNESS : sin9le 
TOPOLOGY: linear 

MOLECULE TYPE: other nucle.ic acid (synthetic DNA ) 
SEQUENCE DESCRIPTION 
TTYGAKCTAA AYGAYGTN 



SEQ ID NO: 6 
SEQUENCE LENGTH: 17 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DKSCHIPTION 
CACGAYGGTC TNGTNCA 



SEQ ID NO: 7 
SEQUENCE LENGTH: 4 5 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION 

AACTGGAAGA ATTC(3C<3GCC GCAGGPlATTT tTTTTTTTTT TTTTT 45 
SEQ ID NO: 8 



75 



30 



SO 



SEQUENCE LENGTH: 21 



SEQUENCE TYPE: nucleic acid 



55 
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STRANDEDNESS : single 
^ TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid (synthetic DNA ) 
SEQUENCE DESCRIPTION 

10 

TGGAAGAATT CGCGGCCGCA G 

15 
20 
25 
30 
35 
40 
45 
SO 
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TO 



15 



20 



25 



30 



35 



40 



45 



SO 



SEQUENCE LISTING 

<110> Director General of National InsCitutie of Fruit Tree Science^ 
Ministry of Agriculture^ Forestry and Fisheries; 
BiO'Oriented Technology Research Advancement Institution 

<120> p -CAROTENE HYDROXYLASE GENE 

cl30> N. 75714 GCW 

<140> EP983098S9.1 
<141> 1998-12-02 

<150> iJP97/331936 
<151> 1997-12-02 

<160> 8 

<170> Patentin Ver. 2.0 

c210> X 
<2X1> 1158 
<212> DNA 

<213> Citrus unshiu 

<220> 
<221> CDS 

_c 22 2 >„( 8 7-)- (10 1 9 ) 



<400> 1 

ccacaatcca cttcacatca actcttcctc ttttcaagtg cttttactct aaaacccaaa 60 

acctcgtaaa caaacaaaac cccacc atg gcg gtc gga eta ttg gcc gcc ata 113 

Met TUa Val Gly Leu Leu Ala Ala Xle 
1 5 

gtc ccg aag ccc ttc tgt etc etc aca aca aaa ett caa ccc tct teg 161 
Val Pro Lys Pro Phe Cys Leu Leu Thr Thr Lys Leu Gin Pro Ser Ser 
10 15 20 25 

etc etc aca aca aaa eee get eec ett ttt gcc cet etc ggt ace cac 209 
Leu I^eu Thr Thr Lys Pro Ala Pro Leu Phe Ala Pro Leu Gly Thr His 
30 35 40 

cat gge ttc ttt aat ggc aaa aac ega aga aaa etc aae tct ttc ace 257 
His Gly Phe Phe Asn Gly Lys Asn Arg Arg Lys Leu Asn Ser Phe Thr 
45 50 55 

gta tgt ttt gtt tta gag gag aaa aaa caa age ace cag ate gag act 305 
Val Cys Phe Val Leu Glu Glu Lys Lys Gin Ser Thr Gin lie Glu Thr 
60 65 70 



55 
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25 



45 



SO 



So 



ttc acg gac gag gag gag gag gag teg ggt acc cag ate teg act get 353 
Pile Thr Asp Glu Glu Glu 61u GIu Ser Gly Thr Gin lie Ser Thr Ala 
75 80 85 

gcc cgc gtg gcc gag aaa ttg gcg aga aag aga tec gag agg ttc act 401 
Ala Arg Val Ala Glu Lys Leu Ala Arg Vys Arg Ser Glu Arg Phe Thr 
90 95 LOO 105 

tat etc gtt get gcc gtc atg tct agt ttt ggt ate act tee atg get 44 9 
Tyr Leu Val Ala Ala Val Met Ser Ser Phe Gly lie Thr Ser Met Ala 
110 lis 120 



gtc atg get gtt tat tac agg ttc tgg tgg caa atg gag ggt gga gag 497 
Val Met Ala Val Tyr Tyr Arg Phe Trp Trp Gin Met Glu Gly Gly Glu 
125 130 135 

gtg cct tta get gaa atg ttt ggc aca ttt get etc tct gtt ggt get 545 
Val Pro I.eu Ala Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly Ala 
140 145 150 

20 

get gtg ggc atg gag ttt tgg gca cga tgg get cat aaa get etg tgg 593 
Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Lys Ala Leu Trp 
155 160 165 

cat get tct tta tgg cat atg cac gag tct cac cat cga cca aga gag 641 
His Ala Ser Leu Trp His Met His Glu Ser His His Arg Pro Arg Glu 
170 175 180 185 

ggt cct ttt gag eta aac gat gtg ttt gcc ata ate aac gca gtt cca 669 
Gly Pro Phe Glu X«eu Asn Asp Val Phe Ala lie lie Asn Ala Val Pro 
30 190 195 200 

gcc ata gcc ctt etc tct ttt ggc ttc ttc cac aaa ggc ctt gta cct 737 

Ala lie Ala Leu Lieu Ser Phe Gly Phe Phe His Lys Gly Leu Val Pro 

205 210 215 

35 

ggt etc tge ttt ggt get gga ctt ggc att aeg gtg ttt ggg atg gcc 785 

Gly Leu Cys Phe Gly Ala Gly Leu Gly lie Thr Val Phe Gly Met Ala 

220 225 230 

tac atg ttc gtc cac gat ggt etc gtt cac aaa agg ttc cct gtg ggt 833 
40 Tyr Met Phe Val His Asp Gly Leu Val His Lys Arg Phe Pro Val Gly 

235 240 245 

ecc att gcc gac gtg cct tat ttc egg aga gtc get gcg get cac eag 881 
Pro lie Ala Asp Val Pro Tyr Phe Arg Arg Val Ala Ala Ala His Gin 
250 255 260 265 



Ctt cac cac teg gat aaa ttc cac ggt gtt cca tat ggg etc ttt etc 929 
Leu His His Ser Asp Lys Phe His Gly Val Pro Tyr Gly Leu Phe Leu 
270 27S 280 

gga cct aag gag ctt gaa gaa gtg ggg gga eta gaa gaa ttg gag aag 977 
Oly Pro Lys Glu Leu Glu Glu Val Gly Gly Leu Glu Glu Leu Glu Lys 
285 290 295 
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gag ate agt aag aga ate aaa tea tac aac agg gtt cca aaa 1019 
Glu He Ser Lys Arg He Lys Ser Tyr Asn Arg Val Pro Lys 
300 305 310 

taatcaattt aatgggagga ccaatttttg gatcaatttg tcagtgtaca gaaacaatag 1079 

tgttattaat gaaaaaaata aattatgaat gcccatgggt ggattactgt tgtaaagttt 113 9 

atgatgttaa ataatatat 1158 

<210> 2 
<211> 311 
<212> PRT 

<213> Citrus unshiu 
<400> 2 

Met Ala Val Gly Leu I*eu Ala Ala He Val Pro Lys Pro Phe Cys Leu 
15 10 15 

Leu Thr Thr Lys Leu Gin Pro Ser Ser Leu Leu Thr Thr Lys Pro Ala 
20 25 30 

Pro Leu Phe Ala Pro Leu Gly Thr His His Gly Phe Phe Asn Gly Lys 
35 40 45 

Asn Arg Arg Lys Leu Asn Ser Phe Thr Val Cys Phe Val Leu Glu Glu 
50 55 60 

Lys Lys Gin Ser Thr Qln He Glu Thr Phe Thr Asp Glu Glu Glu Glu 

Glu Ser Gly Thr Gin He Ser Thr Ala Ala Arg Val Ala Glu Lys Leu 
85 90 95 

35 Ala Arg Lys Arg Ser Glu Arg Phe Thr Tyr Leu Val Ala Ala Val Met 

100 105 110 

Ser Ser Phe Gly He Thr Ser Met Ala Val Met Ala Val Tyr Tyr Arg 
lis 120 125 

40 

Phe Trp Trp Gin Met Glu Gly Gly Glu Val Pro Leu Ala Glu Met Phe 
130 135 140 

Gly Thr Phe Ala Leu Ser Val Gly Ala Ala Val Gly Met Glu Phe Trp 
145 150 155 160 

45 

Ala Arg Trp Ala His Lys Ala Leu Trp Hie Ala Ser Leu Trp His Met 
165 170 175 

His Glu Ser His His Arg Pro Arg Glu Gly Pro Phe Glu Leu Asn Asp 
so 180 185 190 

Val Phe Ala He He Asn Ala Val Pro Ala He Ala Leu Leu Ser Phe 
195 200 205 

55 
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Gly Phe Phe His Lys Gly Leu Val Pro Gly Leu Cys Phe 61 y Ala Gly 
210 215 220 

Leu Gly He Thr Val Phe Gly Met Ala Tyr Met Phe Val His Asp Gly 
225 230 235 240 

Leu Val His Lys Arg Phe Pro Val Gly Pro He Ala Asp Val Pro Tyr 
245 250 255 

Phe Arg Arg Val Ala Ala Ala His Gin Leu His His Ser Asp Lys Phe 
260 265 270 

His Gly Val Pro Tyr Gly Leu Phe Leu Gly Pro Lys Glu Leu Glu Glu 
275 280 285 

Val Gly Gly Leu Glu Glu Leu Glu Lys Glu He Ser Lys Arg He Lys 
290 295 300 

Ser Tyr Asn Arg Val Pro Lys 
305 310 



<210> 3 
<211> 8 
<212> PRT 

25 <213> Artificial Sequence 



<220> 

<223> Peptide designed based on the highly conserved 
amino acid secjuence of carotene hydroxylase 

<400> 3 

Phe Glu Leu Asn Asp Val Phe Ala 
1 5 



35 <210> 4 

<211> 6 



<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide designed based on the highly conserved 
amino acid sequence of ^-carotene hydroxylase 



<400> 4 

His Asp Gly Leu val His 

45 15 



<210> S 
<211> IB 
<212> DNA 

<213> Artificial Sequence 



<220> 
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75 



20 



25 



<223> Synthetic DNA 
<400> 5 

ttygarctaa aygaygtn 28 



<210> € 
<2ll> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<400> 6 

cacgayggtc tngtnca , 17 



<210> 7 
<2ll> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 



<400> 7 

"ao ■ aactggaaEga" attcgcggc<r g^aCgga'attt^tttttttttt "tlftrtt: 45 

<210> 8 
<2ll> 21 
35 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic DNA 

40 

<400> 8 

tggaagaatt cgcggccgca g 21 

45 

Claims 

50 1 . A polypeptide which has p-carotene hydroxylase activity and which comprises an amino acid sequence selected 
from: 

(i) the amino add sequence of SEQ ID NO:2: 

(ii) an amino acid sequence which is at least 70% homologous to the amino acid sequence of SEQ ID NO:2; 
55 and 

(iii) a fragment of sequence (i) or (ii) which is not a said sequence (ii). 

2. A polypeptide according to claim 1 which is a reconlbinant protein selected from: 
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(a) a protein consisting of the amino acid sequence of SEQ ID NO:2; and 

(b) a protein which consists of the amino add sequence of SEQ ID NO:2 having deletion, substitution or addi- 
tion of one or several amino acids. 

5 3. A polynucleotide coding for a polypeptide as defined in claim 1 or 2. 

4. A polynucleotide according to claim 3 which is a DNA. 

5. A polynucleotide according to claim 4, which comprises the coding portion of the nucleotide sequence of SEQ ID 
10 NO:l. 

6. A recombinant vector comprising a polynucleotide as defined in any one of claims 3 to 5. 

7. A transformant which is transformed with a vector as defined in claim 6. 

IS 

8. A method for preparing p-carotene hydroxylase and/or p-cryptoxanthin, which method comprises maintaining a 
transformant as defined in claim 7 under conditions such that the desired p-carotene hydroxylase and/or p-cryptox- 
anthin is expressed and recovering the expressed p-carotene hydroxylase and/or p-cryptoxanthln. 

20 9. A method for preparing a p-carotene hydroxylase according to claim 8, comprising culturing the transformant in a 
medium and recovering the p-carotene hydroxylase from the resultant culture. 

10. A method for preparing p-cryptoxanthin according to claim 8, comprising culturing the transformant in a medium 
and recovering p-cryptoxanthin from the resultant culture. 
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FIG. 2 
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